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Abstract. A polynomial time algorithm which detects all paths and cycles 
of all lengths in form of vertex pairs (start, finish). 



Introduction 

Problem of path/cycle existence and finding is a big topic in Discrete Mathe- 
matics and Computer Science [Il[l[lll[51[Hl[l[Hl[ini[ni[IIl[Il[IllIil[ini[ni 

and others]. This article is not a review but just another design. 

The design's idea may be traced back to Sir William R. Hamilton, [1]: "I have 
lately been led to the conception of a new system, or rather family of systems, of 
non-commutative roots of unity, ... which admit, even more easily than the quater- 
nion symbols do, of geometrical interpretation. ... every one of the results may be 
interpreted, as having reference to the passage from face to face (or from corner to 
corner) of the icosahedron (or of the dodecahedron) ... ." Let us rephrase: a formal 
system was what allowed tracking of the "passages" . 

This work models a graph's walks with a generative grammar and applies dy- 
namic programming to detect paths/cycles. Some deviation from the widely ac- 
cepted terminology is due to the grammar: walks are just sequences of adjacent 
arcs, arcs/loops are walks/paths/circuits/cycles of length 1, etc. Author hopes that 
this will not be a nuisance. Also, any graph becomes a multi graph when talking 
walks. So, let us start with that from the very beginning. 

1. Walk generator 

Let g = {V,A) be a given multi digraph (possible with loops): V is the vertex 
set and A is the arc set of the graph. Let's arbitrarily enumerate vertices: 

V = {vi,V2, ■ ■ ■ ,Vn}, 

- and build a set matrix S = {sij)nxn whose (z, j)-element is the set of all arcs 
which start in vertex Vi and finish in vertex vj; some of the elements can be 0. 
Let's define powers of matrix S: 

S-i = S, 8''+^ = S'^S, k>l. 

The multiplication of set matrices is performed by the rules of the usual matrix 
multiplication but with replacement of the numeric products with Cartesian prod- 
ucts of sets and the numerical sum with set joins: if X = {xij)axb £ind Y = {yij)bxc 
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are set matrices, then 

b 

XY = {[J Xii3 X yi3j)axc- 
13=1 

Let us mention that the set matrix multiphcation is an associative operation. For 
matrix 5*^, the last formula gives: 

n 

(1.1) = S, 5^+1 - ( y iS%f3 X iS)0,)n-xn, k>l. 

13=1 

Here and further, symbol {M)ij means (i, j)-element of matrix M. 

Matrix S" is a walk generator: element {S'^)ij consists of all such walks "written" 
in arcs, which have length k, start in vertex Vi, and finish in vertex Vj] all circuits 
of length k are assembled in diagonal elements. That may be cleared with the 
following decomposition: 



13=1 



U ( U X {S),0) X {S)p, = U [S'^-^, X {S)^p X {S)p, =...= 

0=1 7=1 /3,7=1 
(1-2) — [J('S')(Ti=i,CT2 X (S')o-2CT3 X . . . (S')o-j._io-j. X (S) ,a k + i=j ^ 

(7 

- where join is taken over all samples <j ~ {ai, tT2, . . . , crfc+i) of fc + 1 numbers 
constrained with the following: 

(1.3) CTi i; 1 < cr„i < n, m = 2, 3, fc; ak+i = ]■ 

Let (<S'p^^)y be a limitation of ioin [TT2l to those samples a which are permutations; 
and [S^^)ii be a limitation of join 11.21 to those samples a which are circular 
permutations. 

Theorem 1.1. In multi digraph g: 

1. There is a walk of length fc > 1 from Vi into Vj iff {S'^)ij ^ 0; 

2. There is a path of length fc > 1 from Vi into Vj iff {Sp)ij ^ 0; 

3. There is a cycle of length fc > 1 attached to Vi iff {Sq)^ ^ 0; 

J^. There are paths and cycles of length 1 iff appropriate elements of walk generator 
S ~ are not empty. 

Exercise 1.2. Suppose g has two vertices vi, V2 and four arcs an, ai2, 021, and 
a22 : arc begins in vertex Vi and ends in vertex Vj . Then, the walk generator for 
g is the following set matrix: 

{an} {ai2} 
{a2i} {022} 

The arcs/loops are walks of length 1. Arcs ai2 and 021 are paths of length 1. Loops 
an and 022 are cycles of length 1. 

Walks of length 2 constitute elements of set matrix S'^: 

{onan, 012021} {011012,012022} 
{021011,022021} {021012,022022} 



= S ■ 



= SS^ 
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There are no paths of length 2. Diagonal elements are chxuits of length 2. Among 
them is cycle (012021) — (021012). It is a Hamiltonian cycle. 

Walks of length 3 constitute elements of set matrix 5*^ = S'^S, and so on. 

Iterations II . II create a language - we may call it an arc language. Its alphabet is 
A, and its grammar is a context-sensitive L-system whose axiom is walk generator 
S and whose production rules, in essence, consist of omitting the curly braces in 
formula ll.il and replacing with empty string. The k-th generation of the system 
consists of all walks/strings of length k written in arcs. The walks are sorted over 
vertex pairs (start, finish). 

The exact numbers of /c- walks in these vertex pairs (start, finish) are elements 
of fc-th power of the multi digraph's adjacency matrix. From formula 11.2) a rough 
upper bound for the numbers is 

|y|^-i(max|5,,|)'=. 

The bound is reachable in complete graphs. General walk pattern is a path with 
cycles attached, and multiplicand is due to the branching. Nevertheless, 

using formula ll.ll can be an efficient method for particular problems/graphs - sparse 
digraphs, trees, and others with simple walk pattern. 

Exercise 1.3. In the example from exercise 11.21 there are 2'^+^ walks of length 
k, fc > 1, because there is cycle 012021 with two cycles/loops attached to it - on 
and 022. Written in arcs, the pattern of walks from Vi into Vj is a self-similar 
structure described by the following grammar: 

- where symbol [x\ means zero or more repetitions of x. 

For the path/cycle problem, the algorithmic simplicity of formula 11.11 looks 
tempting: all paths/cycles could be found with ffitering of all sets {S'')ij. Al- 
though those sets have exponential power, filtering per se could be made efficient 
due to a possibility of sorting/ffitering the sets during iterations 11.11 That which 
makes the method inefficient is a need for calculation/remembering of exponential 
size sets - elements of . The problem can be tried and fought with walk coloring. 

2. Walk coloring 

Walk colors are a semantics of the arc language defined by formula 11.11 Col- 
oring can be done with an arc coloring; a function defined on walks; and with 
other means, as well. For example: adjacency matrix is a coloring with a "number 
of walks" function; transitional diagram is a digraph arc-colored with the events 
triggering transitions; Markov chain is a digraph arc-colored with the transitional 
probabilities; the Icosian Calculus [Tl[2] is the icosahedron colored with the Hamil- 
ton's symbols, etc. The original walk generator defined in section [T] is a coloring 
appropriate to the identity function on set of all walks. Its opposite is a monochro- 
matic coloring - all walks have the same color. 

Exercise 2.1. In the example from exercise 11.21 let's color the arcs starting in vi 
with color "Head" and the arcs starting in V2 with color "Tail". Also, let's use the 
string presentation of iterations 11.11 described in that section. Then, the example 
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will become an algorithmic model of coin flipping: 

\ T T ) ' \ TH, TT TH, TT 

Changing of the colors to "0" and "1" will make the example a generator of 
all/random finite (0, l)-sequences. 

Exercise 2.2. Let P be the following monochromatic arc coloring of g: 

^^^'^ - \ 0, (5)., = ■ 
Then, number of different Hamiltonian cycles in g is 
1 



71 



IJ2(T3 ■ ■ ■ 



where sum is taken over all circular permutations a — (tri, (T2, . . . , (T„) of numbers 
l,2,...,n. 

In the monochromatic case, color is just a sign of adjacency. The coloring can be 
modeled with Boolean matrix - a matrix filled with true and false (for 0). Then, 
iterations 11.11 can be replaced with formulas for powers of that Boolean matrix. 
The powers are calculated with the algorithm for numeric matrix multiplication 
except the products are replaced with conjunctions and the sums are replaced with 
disjunctions: ii X — {xij)axb and Y — {yij)bxc are Boolean matrices, then 

b 

(2.1) XY = {y x^p Ayi3j)axc- 

13=1 

Let us mention that the Boolean matrix multiplication is an associative operation. 
Boolean arc coloring is as follows: 

true, {S)ij ^ 
false, {S)ij = 

And Boolean walk coloring is defined with the following iterations: 



(2.2) 



(B'h = { 



(2.3) 3"+' = ( V iB%3 A {B')0j)nxn, k>l. 

The true/ false coloring is efficient for the reachability problems: formula 12 . 31 finds 
all pairs of connected vertices with {2n — l)n^ operations "V" and "A" at most. 
To prove that, let's perform decomposition of {B^)ij exactly like for formula ri.2l 

(2.4) (B^-+i),, = \J{B%,^,^,, A (i?i),,.3 A . . . (i?i).,_,., A {B^) 



- where disjunction is taken over all number samples a which satisfy constrains 1 1.31 
Let {Bp'^^)ij be a limitation of disjunction 12.41 to those samples a which are 
permutations; and {Bj^^^)ii be a limitation of disiimction 12.41 to those samples a 
which are circular permutations. Then, theorem 11.11 implies 

Theorem 2.3. In multi digraph g: 

1. There is a walk of length k > 1 from Vi into vj iff {B^)ij = true; 

2. There is a path of length k > 1 from vi into Vj iff {Bp)ij = true; 

3. There is a cycle of length k > 1 attached to Vi iff {BQ)ii — true; 
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4- There are paths and cycles of length 1 ijf appropriate elements of Boolean matrix 
are true. 

Although Boolean coloring 12.21 12.31 is inefficient for the path/cycle problem in 
multi digraph, it still can be efficient for other particular problems. 

Exercise 2.4 (Shortest Path). Length of the shortest path from vertex Vi„ into 
vertex Vj^ can be calculated with formula 12.31 by multiplication of iQ-th string of 
Boolean matrix B^ on whole Boolean matrix B^ until element {B^'^^)i^j^ will be- 
come true, k — 1,2, ... ,n — 2. Then, the shortest path has length k + 1. The table 
below compares the method with classical routing algorithms: 



Algorithm 


Formula [231 


Bellman-Ford 


Dijkstra 


Time complexity 


o{\vn 


od^ll^l) 


om^ + \A\) 



Let us emphasize the algorithmic simplicity of formula l2.3l and the potential of that 
formula for further computational simplifications. 

Exercise 2.5 (Maximum Clique). Let Q be the following Boolean matrix: 

_ r (Si)y A (Bi),,:, z > J 



("^^'^ ^ "I false, z < J ' 

- where B^ is defined with formula [2?2l Let's define with the following iterations: 

Qi = Q, Q'»+i = Q"Q A Q"\ m > 1 

- where product of Boolean matrices is defined with formula 12. li and conjunction 
of Boolean matrices is a matrix of conjunctions of appropriate elements. It may be 
seen that maximum clique in g has the following size: 

q = niin{fc | Q'' = ifalse)nxn}. 

Let us clarify that any vertex is a clique of size 1 and sketch a proof: 

Sketch of proof . Boolean matrix Q defines a subgraph of multi digraph g. This 
subgraph is a directed forest. All walks in this forest are paths. Boolean matrix 
Q'' rids that forest's arc set of all arcs except those vertex pairs {vi,Vj) which are 
start and finish of paths of all length 1,2, ... ,k. Cliques of size fc + 1 are the only 
invariants under such depletion of g. □ 

For multi digraph, the path/cycle problem's semantics is "to visit vertices only 
once". That gives an idea of efficient walk coloring for the problem. The idea 
is to make from the walk colors a memory. It is an interesting question, what 
to remember. Specter of options stretches from remembering visited vertices only 
to remembering unvisited vertices only. Disconnected graphs make the strategy 
question non-trivial. Nevertheless, those extreme strategies are dual and both based 
on an innate knowledge ll.il Let's explore the rationalistic approach. 



3. Path problem 

Let's encode walks with the unvisited vertices. Two opposite strategies are 
possible: to remember arcs' start-vertices or to remember arcs' finish-vertices. The 
strategies are dual. They correspond to the ambiguity of words "visited vertex": 
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was it exited already or just entered? 

Arc coloring/coding with start-vertices is as follows: 

^^■'^ 0, (5)., =0 V * = j ' 

- where S the walk generator. Walk coloring with start-vertices is defined with the 
following iterations: 

(3.2) (^.-.i)^^_(U;=,(i^'=)..n(^V-, fc>i 

The formula may be seen as another set matrix multiplication. 

Let's call that coloring defined with formulas 13. II and 13.21 a F-coloring: all arcs 
which start from vertex Vi have the same color; the color is set V — {vi]\ the diagonal 
elements are rid of because they are loops/circuits; all walks from vertex Vi into 
vertex Vj have the same color; the color is set of all vertices which were missed by 
one or more walks from Vi into Vj - those vertices missed by a walk will extend that 
walk in the future when grammar II . II will allow that. To clarify that, let's perform 
decomposition of a non-empty set {F*'^^)ij: 



[3=1 

n n n 

= U { U n {F%p} n {F'),, = U {F'^-'h, n {f%^ n {f'),, = ...= 

13=1 7 = 1 13,1 = 1 

(3.3) = U(^')-i--2 n {F^)a,a, n . . . n {f^).,_,., n (i^^).,,.,+,=„ 

cr 

- where join is taken over all number samples a = (cti, (72, ... , crfe+i) which satisfy 
constrains 11.31 

That which makes _F-coloring special is the following inclusion: 

(3.4) {F%, C V. 

It will be shown that this inclusion makes the coloring an efficient one. 
Exercise 3.1. For the example from exercise [1.21 

{V2} 

{vi} 



_ ( 0n{i;2} u {1^2} n I 

' {wi}n0 u 0n{fi} 



Thus, due to formulaO F'^ = (0)2x2 for all powers fc > 2. 

Lemma 3.2. In multi digraph g, if there is a path of length k > I from vertex Vi 
into vertex Vj, then {F^)ij ^ 0. 

Proof. Suppose, vertices 

Vfj,i=i, Vfj_^ , . . . , W^fc , Vfj,^_^-^=j 

constitute a path of length k from Vi into Vj. Then, 

f/ifc+i i^v^^, m = 1,2, . . . ,fc. 
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Thus, by definition 13. II 

v^.,+, e ^ m = 1, 2, . . . , fc. 

Then, due to decomposition [3?3l 

□ 

Lemma 3.3. In multi digraph g, if (F^)ij ^ 0, then there is a path of length k 
from vertex Vi into vertex Vj . 

Proof. Let's use mathematical induction over k. 

For k = 1, by definition 13. 1[ inequahty {F^)ij ^ impHes 

(5)., ^ 0. 

Then, there is an ark a G The arc is a path of length 1. 

Let's assume that the lemma is true for k — I. 

Let fc = Z + 1. Due to presentation 13.31 inequality {F^)ij 7^ implies existence 
of such number sample ctq — (a*2, /^3, • • ■ , A^fc) that 

- (F%=^,,.^, n (^1)^,^3 n (Fi)^3^, n . . . n n (f1)^,^^,^,^, ^ 0. 

Then, due to definition 13. 1[ there are k arcs 

'^PmMm + l ^ + l 1 ?W = 1 , 2 , . . . , fc 

- where S is the walk generator of g. The arcs create a walk w of length fc from 
into Wji 

Let's notice that 

(f^-')mip. ^ (^').=m,M2 n (Fi)^,^3 n (^1)^3^, n . . . n (f1)^,_„;., 3 ^ 0. 

Thus, due to the induction hypothesis, the following sub-walk of walk w 

{Vpi=i, a,fj,-^p2 J f^P2 1 "^^2^3 J • ■ • I ^Mfc ) 

is a path of length fc — 1 from Vi^^-^ into v^^, . From the other side, 

(^'"')m2p.+i ^ iF')p,ps n (Fi)^3^, n . . . n (f1)^,_,,^, n iF')p„p,^,=, 3 ^ 0, 

as well. Thus, due to the induction hypothesis, the following sub-walk of walk w 
is a path of length fc — 1 from v^^ into Vf^^,^^^j. Thus, if 

Vi ^ Vj , 

then walk w will be a path of length fc from Vi into Vj . That last inequality follows 
from definition 13.21 □ 

Let us emphasize the role of grammar 1 1.11 on an example of a disconnected graph. 
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Exercise 3.4. Let's add to the digraph from exercise 11.21 vertex vz, arc 023 from 
V2 into V3, and isolated vertex V4. Walk generator for the resulting graph is 



( {an} {012} \ 

{021} {022} {023} 



V / 



There are three paths of length 1 in the digraph: (wiai2W2), (f2a2ii'i), and (i'2a23'^3) 
Also, there is one path of length 2 in the digraph: (f iai2f2a23^'3)- 
Arc f -coloring 13. II detects the 1-paths: 



{V2,V3,V4} 



{vi,V3,V4} 



\ 



Iterations 



{vi,V3,V4} i 





detect the 2-path and absence of the longer paths 

/ {v3,Vi} \ 



V 



j4x4- 



/c > 3. 



Together, lemmas [3.21 and [3731 constitute a theorem: 

Theorem 3.5. In multi digraph g, there is a path of length k > 1 from vertex 
into vertex Vj iff {F^)ij ^ 0. 



Let's estimate computational complexity of theorem 13.51 Due to inclusion 13.41 
arc i^-coloring 13.11 requires 0{n^) operations. And walk i^-coloring 13.21 requires 
0{kn'^) operations: 0{n?') operations at most to calculate each of elements 
{F"^)ij, m = 1,2, ... ,k. The total computational complexity of theorem 13.51 is 



Particularly, when k 
time 



0{kn^). 

1 , theorem 13.51 solves the Hamilton path problem in 
Oin'). 



Obviously, the results are true when an "arcs' finish-vertices" coloring is used 
instead of the "arcs' start-vertices" coloring described. A finish-vertices coloring 
can be done with the following G-coloring. Arc G-coloring is defined as follows: 



(3.5) 



V-{vj}, (5),, 7^0 M^j 
0, (5)y = V i = J ' 



- where S is the walk generator. Walk G-coloring is defined with the following 
iterations 



(3.6) 



(G^-'+i) 



0, i = j 



k>l 



FINDING PATHS AND CYCLES IN GRAPHS 



9 



4. Cycle problem 
Let us define a ff-coloring. Arc i/-coloring is as follows: 



(4.1) 



0, ' 



where S is walk generator 1 and walk i7-coloring is as follows: 



(4.2) 



fc > 1, 



- where set matrices F and G were defined in section [3l 
Exercise 4.1. For the graph from exercise [ 



Then, 



{vi,V3,Vi} 

{w2,f3,W4} {w2,W3,f4} 





/ {an} \ 



{a22} 



V 



V 



j4x4: 



A: > 3. 



/ {«2,W3,W4} I 

{W1,W3,W4} 



I 

Let's notice that non-empty elements of the coloring indicate all cycles in the di- 
graph: loops an, a22 and 2-cycle (■i;iai2W2a2iWi)- 

Lemma 4.2. In multi digraph g, let p be a path of length k: 
Then, 

"^/Jfe+l S (-^ )/JlM2 (-^ )m2M3 n . . . n (-F )tiktik + l — (-^ ')a'1M): + 1- 

Proof. Sec proof of lemma 13.21 and decomposition 13.31 □ 



Theorem 4.3. In multi digraph g, there is a cycle of length fc > 1 attached to 
vertex Vi iff {Il'')u ^ 0. 

Proof. For fc = 1, the theorem is true due to definition 14.11 Let fc > 1. 
Necessity. Let c be a cycle of length k attached to vertex Vii 

- where axy G {S)xy are arcs and 5* is the walk generator. Then the following walk 

(l^iai;ii'f^/iia/Ji/i2 ■ ■ ■ "^fJ-k-l) 

is a path of length fc — 1. Then, due to lemma 

t'M.-i e(F'=-i),^,_, ^0. 

On the other hand, due to definition 13.51 

v^,_, e(Gi)^,_,, ^0. 
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Thus, due to definition 14.21 
Sufficiency. Let 

Then, due to definition 14. 2 i there exists such couple of sets {F^~^)ix and {G'^)xi 
that 

(F^-i)„n(Gi)„^0. 

Then, due to theorem [33] and definition 13. 51 there exist a path of length fc — 1 from 
Vi into Vx and an arc from Vx into Vi. These path and arc constitute a cycle of 
length fc. The cycle is attached to vertex Vi. □ 

Let's estimate computational complexity of theorem 14.31 By definitions 13.51 
and due to inclusion 13. 4( 

{H%, C V. 

Thus, arc iJ-coloring 14.11 requires 0{n^) operations. And walk i7-coloring requires 
0(kn'^) operations: O(fcn^) operations to calculate F^~^\ 0{rv') operations to cal- 
culate and O(n^) operations at most to calculate each of n diagonal elements 
of . The total computational complexity is 

O(fcn^). 

Particularly, when k — n, theorem 14.31 solves the Hamilton cycle problem in time 
0{n^). But a simplification is possible. In this case, there is no need for the 
whole matrix F^~^ but only for any one of its strings. That allows solving the 
Hamiltonian cycle problem with theorem 14.31 in time 

0{n^). 

Obviously, i^-coloring and G-coloring may be swapped in theorem 14.31 

Conclusion 

The following polynomial time algorithm detects all paths and cycles of all 
lengths in form of vertex couples (start, finish): 

Step 1: Calculate set matrices , G^, and H^. Non-empty elements of 
matrices F^ and indicate 1-paths and 1-cycles appropriately; 

Steps 2 n: Calculate set matrices F'' and H'', k = 1,2, ... ,n. Non-empty 
elements of matrices F'^ and H'' indicate /c-paths and A;-cycles appropri- 
ately. 

The algorithm could be simplified, for example, with vertex aggregation. In the 
formulas above, the aggregation might be presented with appropriate box set ma- 
trices. The modification may be seen as an incorporation of the DJP algorithm's 
idea. 

Further time-simplifications could be achieved by mixing the described "unvis- 
ited vertices" walk coloring with a "visited vertices" walk coloring. 

Let us emphasize that the algorithm solves a decision problem "exist/not ex- 
ist" . Although, lemma 14.21 or a "visited vertices" walk coloring can be used for 
restoration/selection of particular paths/cycles. 
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